PATENT PAT-008/A 

SCRIPT COMPLIANCE AND QUALITY ASSURANCE USING SPEECH 

RECOGNITION 

Cross-Reference to Related Applications 

This application is a continuation-in-part of pending US patent application Serial Number 
5 09/785,048, filed on 15 February 2001 , entitled "SCRIPT COMPLIANCE USING SPEECH 
RECOGNITION". The benefit of the filing date of such application is hereby claimed to the 
fullest extent permitted under 35 U.S.C. § 120, and the text of such application is incorporated 
herein by this reference as if set forth herein verbatim. 
Brief Description of the Drawings 
10 Fig. 1 is a block diagram demonstrating aspects of a communications system. 

Fig. 2 is a block diagram showing a call center implementation of the described 
communications system. 

Fig. 3 is a block diagram of a scripting package for use in the described communications 
system and methods. 

15 Fig. 4 is a block diagram of a quality assurance logging process and quality assurance 

method. 

Fig. 5 is a block diagram showing a number of call center actions forming part of the 
communications system and methods. 

Figure 6 is a data flow diagram of additional aspects of the invention suitable for 
20 identifying further forms of agent non-compliance with scripts governing interactions involving 
such agents. 

Detailed Description of the Preferred Embodiments 
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PATENT PAT-008/A 
Overview of the Invention 

Apparatus and methods are provided for using automatic speech recognition technology 
to analyze a voice interaction and verify compliance of an agent reading a script to a client 
during the voice interaction. The apparatus and methods are particularly suited for use in any 
5 situation where a voice interaction takes place in which at least one participant is obliged to 
follow a prepared script, and are particularly suited for use in the operation of a call center, such 
as, for example, to evaluate or verify that call center agents are properly reciting scripts during 
telephone or web-based calls to or from call center customers. 

In one aspect, a communications system includes a voice communications network 

10 providing voice connectivity between a system user and a call center. The call center preferably 
includes a call control device for receiving and routing calls, one or more agent workstations at 
which an agent is able to process an incoming or outgoing call, and a script compliance module 
for analyzing a voice interaction between the system user and the agent. The system user is able 
to access the communications system with any type of voice communications device, including, 

15 for example, a telephone, a voice-capable computer, or a wireless communications device. The 
voice communications network is provided with any form of voice communications capability 
needed to support the user's voice communications device, such as a digital communications 
network, standard telephone network, internet-based, or wireless network. The call control 
device provides the functions of receiving the voice communication from the communications 

20 network and routing the call to the agent workstation. The agent workstation will typically 

include a telephone and a computer, with the computer being optionally networked to a database 
for data access by the agent. 
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The script compliance module is provided with an automatic speech recognition (ASR) 
component, such as that provided by a speaker-independent, continuous speech, multilingual, 
multi-dialect ASR component such as those known in the art. The ASR component is adapted to 
receive a digital signal representing a voice interaction between the system user and the agent, 
5 and to provide an output of an analysis of the digital signal for use in a quality assurance (QA) 
process. 

In another aspect, a method is provided for analyzing a voice interaction and verifying 
compliance of an agent reading a script to a client during the voice interaction, for example, as 
part of a telemarketing campaign. The voice interaction preferably takes place between a system 

10 user and an agent over the communications network, but may alternatively be a face-to-face 
voice interaction or any voice interaction capable of being captured and analyzed by an ASR 
component. The agent may be physically located within the call center, or may be at a distant 
location, but the voice interaction is preferably routed through the call control device at the call 
center. In the preferred embodiment, the agent is responsible for referring to and following a 

15 prepared script for at least a portion of the voice interaction. The voice interaction is captured, 
converted to digital form, and exposed to the ASR component, in real-time or in a recorded form, 
and the ASR component analyzes at least a portion of the voice interaction. The analyzed 
portion is compared against a standard, preferably the expected content from the prepared script 
or script portion associated with the given portion of the voice interaction, and a determination is 

20 made concerning the extent to which the agent complied with the script during the voice 

interaction. For example, one or more portions of the voice interaction may be assigned a score 
to indicate a level of script compliance by the agent, as determined by the ASR component, and 
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taking into account any limitations (e.g., confidence-level thresholds) in the ASR component's 
ability to evaluate the voice interaction. 

In yet another aspect, one or more actions are taken based upon the above script 
compliance determination. In a preferred embodiment, these actions are taken as part of a 
5 quality assurance or employee incentive program. The actions include, for example, sending the 
voice interaction to a quality assurance monitor for review, assigning the agent for random voice 
interaction review, sending an e-mail or other flag to an oversight authority for review, sending a 
voice or text message to the agent, updating a file associated with the agent, updating an 
incentive program to reflect the compliance determination, or other such actions. 

10 In yet another aspect, a scripting package and quality assurance process are constructed to 

provide panel-level review of a voice interaction during the quality assurance process. The 
scripting package preferably includes a plurality of call scripts used by the agent during the voice 
interaction, a log record layout including provision for each value logged during the voice 
interaction, and a plurality of ASR reference texts corresponding with the plurality of call scripts. 

15 The voice interaction is recorded and logged, including a timestamp and time displacement for 
each script panel occurring during the voice interaction. The quality assurance process includes 
a provision for retrieving and reviewing the recorded voice interaction by panel level. 
Accordingly, if a script compliance scoring system is used, the score may be retrieved and 
reviewed for each panel forming a part of the voice interaction without having to review the 

20 entire voice interaction. 

Several advantages are obtained through use of the apparatus and methods so described. 
For example, the described apparatus and method provide a script compliance function having a 
wide range and scope of applications at a relatively minor expense when compared to non- 
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automated management systems. By employing an ASR component to analyze and evaluate the 
voice interactions, a call center provider can decrease or avoid the need to have individual 
managers or other call reviewers perform those functions. This becomes particularly 
advantageous to call centers having several agents, perhaps dozens or hundreds, or where the 
5 agents are not physically located on the call center premises. 

A further advantage obtained by the present apparatus and methods is the ability to 
provide useful information concerning agent script compliance to a quality assurance (QA) 
authority in a time-effective manner. For example, when the apparatus and methods are used in 
real-time, a report may be submitted automatically to a QA authority almost immediately after a 
10 given voice interaction is completed. Where the voice interaction is recorded and reviewed later, 
time delays may still be minimized. In addition, near instantaneous feedback may be given to an 
agent to attempt to minimize problems with script compliance. 

A still further advantage of the described systems and methods is the provision of panel- 
level playback and review of a voice interaction in the quality assurance process. This provides 
15 more effective and efficient methods of quality assurance in, for example, a call center operation. 

Other and further advantages are described below and still others will be apparent from a 
review of the descriptions contained herein. 

The communications systems and script compliance methods may optionally include 
additional, or fewer, features and functionality than those described herein for the preferred 
20 embodiments while still obtaining the benefits described. The inventions described herein are 
not limited to the specific embodiments described, or to the specific equipment, features, or 
functionality described for the apparatus and methods of the examples contained herein. These 
examples are provided to illustrate, but not to limit the inventions described. 
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The preferred embodiments include several aspects generally directed to voice 
communications apparatus and methods, several of which are described below. The primary 
preferred embodiment is a script compliance apparatus and method particularly adapted for use 
in a call center, and most particularly in a telemarketing application. While this embodiment is 
5 described in detail herein, it will be understood by those skilled in the art that other and further 
aspects and applications are possible. For example, the systems and methods may be adapted for 
use in call centers for applications other than telemarketing, or for voice interactions not 
associated with call centers or telemarketing operations. The following description is not 
intended to limit the scope of the described inventions, which are instead set forth in the 
10 appended claims. 

Detailed Description 

Figure 1 shows a block diagram of one implementation of the apparatus and methods of 
these inventions. The diagram in Figure 1 reflects aspects of a call center implementation, 
though it will be understood that the various structures and functionalities may be extended to 

15 other implementations, including face-to-face voice interactions, electronic commerce, 

telephone, web, or wireless-based based information services, and the like. The communications 
system shown in Figure 1 includes a user interface 10, a communications network 12, and a call 
center 14, each described in further detail below. 

The user interface 10 provides the function of allowing a system user, such as a 

20 telemarketing customer, to conduct a voice communication with a telemarketing services 
provider. The user interface 10 may be a standard function telephone, a video telephone, a 
wireless communication device, an internet-based communication device, or other instrument 
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adapted to support voice communication. In the preferred embodiment, the user interface is a 
standard telephone. 

The communications network 12 provides the function of transmitting a voice signal 
between the user interface and the call center. Accordingly, the communications network 12 
5 may include an analog or digital telephone network, an internet-based network, a wireless 
network, or any voice communications supporting network. The communications network 12 
supports voice communications between a system user using the user interface communication 
device and, in the preferred embodiment, the call center 14. In the preferred embodiment, the 
communications network is a standard telephone service network provided by a long distance 

10 and/or local service carrier such as AT&T, Sprint, MCI, or others. 

The call center 14 serves as a call termination and servicing point, and may be provided 
having any number of features, functions, and structures. In the typical call center, a call control 
component is provided to automatically receive and route calls to one or more telemarketing 
agents working at agent workstations within the call center. An agent workstation may include 

15 only a telephone, but it is typically provided with a networked computer and terminal used to 
support the agent functions. For example, a central database containing customer information 
and information relating to goods, services, or other offerings being provided by the telemarketer 
is typically provided and is accessible by the computers and terminals located at the agent 
workstations. When a telemarketing call is being processed, information relating to that call 

20 (e.g., customer identification information, product offerings information, credit card information, 
etc.) are automatically sent by the central database to the agent terminal in a "screen pop." The 
agent then reads information from the computer terminal as the call is processed, and enters new 
information as it is obtained during the call. 
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Figure 2 shows additional details of the call center 14 and, in particular, an embodiment 
representing an inbound call center. The call center 14 includes a programmable switch 16 that 
operates to receive incoming calls and to provide an interface for access to calls, call data, and 
other call center operations. The call center also preferably contains an automatic call distributor 
5 (ACD) 18 for routing calls to agents according to pre-determined criteria. While these primary 
functions of the switch and ACD are described, other details and functions of these devices are 
generally known in the art, and will not be discussed here. 

Three agent workstations 20a-c are shown in the call center in Figure 2. It is possible to 
have any number of agent workstations at the call center or, alternatively, to provide off-site 

10 agents that are able to access the call center remotely by another voice communications network 
not shown. In the case of an inbound telemarketing campaign using multiple agents, the switch 
and ACD cooperate to route calls to the appropriate location where an agent is able to process 
the calls. The agent workstation also includes a computer terminal at which data may be 
accessed by the agent. Typical call centers utilize computer-telephone integration (CTI) in 

15 which telephone number information (automatic number identification (ANI) or dialed number 
information service (DNIS)) is associated with other customer information stored on a database 
that is then accessed in real-time during a telemarketing call and a "screen pop" containing this 
information occurs at the agent workstation terminal. Additional information concerning the 
goods, services, or other offerings is also provided to the agent workstation terminal. A central 

20 computer 22 is shown in Figure 2 having a network connection to each of the agent workstations, 
and a connection to the switch to obtain caller information from the incoming call. The details of 
the central computer and network are beyond the scope of the present inventions, and are 
therefore not discussed further here. Moreover, it is typical to provide a call center with other 
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features and functions desired for a given call center application. Although these additional 
features and functions are not explicitly described herein, those skilled in the art will recognize 
that they may be added to the described system consistent with the needs of the given 
application. 

In a particular preferred form, data is provided to the agent workstations during calls in a 
series of "panels", with each panel being associated with a particular script or portion of a script. 
The scripts are prepared as a part of a telemarketing campaign, and include the information 
needed to be given to the customer in a form intended to be effective and efficient to achieve its 
purpose. In particular, in a typical campaign, a telemarketer strives to obtain the most efficient 
result in the shortest transaction time in order to decrease on-line costs. The scripts are, 
therefore, typically highly-developed and tested to determine their effectiveness. A 
telemarketing campaign can be significantly undermined by an agent's failure to closely follow a 
script. 

In addition, by presenting script information in panel form, a quality assurance process 
may preferably be coordinated with the scripting process to provide panel-level playback. This 
panel-level playback, as opposed to the need to play back and/or navigate through an entire 
telemarketing voice interaction to review a certain portion of it, is a significant advantage 
provided by the described system. 

Accordingly, a script compliance module 24 is included in the call center. The script 
compliance module 24 is a software package that is shown in Figure 2 as having an interface 
with the central computer, but its location within the call center is optional, as long as access is 
available to the digitized voice interaction. The script compliance module 24 performs several 
functions within the call center, as set forth in more detail below. The script compliance module 



PATENT PAT-008/A 

includes an advanced speech recognition (ASR) component whereby a voice interaction between 
a customer and an agent may be analyzed and evaluated for compliance with an expected 
standard. As discussed below, the script compliance module may be constructed to operate in 
real-time, i.e., as the voice interaction takes place, or, preferably, it may include a recording 
5 capability such that voice interactions are reviewed and evaluated at a later time. 

The ASR component of the script compliance module is supported by providing an 
appropriate ASR software package. These ASR software packages are commercially available, 
and examples include those available from Nuance Communications (Menlo Park, California) 
and Speechworks International, Inc. (Boston, Massachusetts). A detailed description of speech 

10 recognition technology is not necessary to understand the systems and methods described herein. 
Briefly, however, the ASR component is adapted to capture a voice signal and convert it to 
digital form (if not presented to the ASR component in digital form already). The digital signal 
is then converted to a spectral representation that undergoes an analysis to match the spectral 
representation to a written vocabulary, and converts the voice signal to written text. Currently 

15 available systems are able to analyze continuous, multi-lingual, multi-dialect speech from in a 
speaker-independent manner and convert it to its corresponding text form. 

As noted, the script compliance module 24 may be adapted to operate in real-time by 
including a component for converting the voice interaction to digital form for direct analysis by 
the ASR software package. In that case, the voice interactions are preferably captured live and 

20 fed directly to the digital converter and the ASR software package for analysis. Optionally, the 
script compliance module 24 may be adapted to analyze recorded voice interactions. In 
particular, and preferably, the script compliance module 24 or other system component may 
include one of the commercially available audio recording and monitoring systems such as those 
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available from NICE Systems Ltd. or Witness Systems, Inc. In such a case, the voice interaction 
recorded by the audio recording and monitoring system may supply audio files to the ASR 
software package for analysis. Because recordings of the voice interactions may be useful to a 
call center administrator for other purposes, related or not to script compliance, the preferred 
5 embodiment includes a voice interaction recording component such as those described above. 

The script compliance module 24 preferably includes a scripting package 26, discussed in 
more detail below. The scripting package 26 is depicted graphically in Figure 3, and includes the 
following components: 

First, one or more call scripts 28 are provided. The call scripts 28 may be maintained in 

10 the script compliance module, or, preferably, they may be maintained on the central computer 
and accessible by the script compliance module. The call scripts 28 are accessed during the 
voice interaction and contain the information to be read by the agent to the customer during the 
voice interaction. As noted above, the call scripts 28 are preferably presented in separate panels 
containing discrete portions of the overall call script. As an agent progresses through a call, the 

15 agent moves from a first panel, to a second, to a third, and so on. A single offer of a good or 
service may be contained on a single panel, or on several panels. Alternatively, several offers 
may be presented during a single call. 

Second, a log record layout module 30 is provided. A log record is preferably created for 
each voice interaction taking place at the call center. The log record layout includes data fields 

20 for all data that could be captured during calls, and log records are maintained as part of the 

ongoing function of the call center. The data fields will, of course, vary based upon the 

operation of the call center. Typical data fields will include date and time of call, length of call, 

agent identity, customer identity, and any transaction data obtained during the call. Some data 
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fields may be filled automatically during a call, such as date, time, agent identity, and the like, 
while others may be filled by the agent during the call. 

Third, an ASR text module 32 is provided. The ASR text is a reference text to be used by 
the ASR component of the script compliance module, and corresponds to the call scripts 
5 described above. As with the call scripts, the ASR text is preferably provided in separate panels. 
Fourth, a set of action rules 34 is provided. In the most general sense, the action rules 
take the output of the ASR component evaluation of the voice interaction and, based thereon, 
direct an action to be taken by another component of the script compliance module. The output 
of the ASR component evaluation may comprise, for example, a numerical score indicating the 

10 degree to which the voice interaction complied with the ASR text. The actions directed by the 
set of action rules may comprise, for example, a quality assurance (QA) action to be taken based 
upon the numerical score. For example, scores less than 60 may be sent to a QA authority for 
review, scores between 60 and 80 may have random calls selected for review by a QA authority, 
and scores over 80 may be used to drive a QA incentive program. These are examples only. The 

15 determination of specific standards and actions will depend, of course, on the type of application. 

Fifth, a panel timestamp logging feature 36 is provided. The panel timestamp logging 
feature assigns a time displacement timestamp to each panel as it is presented and viewed by an 
agent during a voice interaction with a customer. For example, in a voice interaction in which a 
first panel is processed in 15 seconds and a second panel is processed in 12 seconds, the first 

20 panel will log from 0:00:00 to 0:00:15 (i.e., the duration of the voice interaction relating to the 
first panel) and the next panel will log from 0:00:16 to 0:00:27. This progression continues for 
each panel used during the voice interaction. A log of the timestamps is maintained for each 
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voice interaction. The timestamps are then preferably used in the quality assurance process to 
facilitate panel-level playback of the voice interaction. 

The communications system operation will now be described in reference to Figure 4, 
and in the context of a telemarketing call. A telemarketing agent and a customer engage in a 
5 voice interaction during which the agent processes the call 40, i.e., the agent reads from scripts 
presented on the workstation terminal and enters information in the fields provided according to 
responses obtained from the customer. As noted above, the scripts are preferably presented to 
the agent in panels, with each panel corresponding to a portion of the overall script, or to a 
separate script. The time displacement per panel is logged 42 as a portion of the log record. 

10 Once the call is completed 44, all data obtained during the call is logged according to the log 

record layout. If a voice recording or video recording is made, they too are logged and stored for 
later use in the QA process. 

The QA process 46 is next begun by retrieving the voice interaction record. The log 
record is also retrieved and reviewed to determine which scripts were to have been recited by the 

15 agent, and the corresponding ASR texts are retrieved for the ASR analysis. The voice and/or 

video recording is preferably divided into panel-level segments 48 for review and evaluation, and 
the log record is evaluated 50 to determine the expected ASR text by panel. A comparison of the 
voice interaction with the ASR text is then performed by the ASR component in order to 
determine the degree of compliance of the voice interaction with the ASR text. In the preferred 

20 embodiment, the ASR component assigns scores 52 based upon the level of accuracy of the 

comparison. Confidence-level thresholds are used in evaluating the match accuracy. After each 
panel is evaluated and scored, an overall score may be determined. The panel-level scores and 
overall scores are next used to determine any action 54 to be taken as provided in the pre- 
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determined set of action rules. Examples of such actions include sending an e-mail containing 
the file for review, providing a feedback message to the agent, or other actions tailored to the 
particular application. 

As an extension of the QA process, the stored voice interaction and log records may be 
5 retrieved from the system by a QA authority at a later time for additional analysis. The records 
may be used to review the assigned panel-level and/or overall compliance scores. In addition, all 
or a portion of the voice and/or video recording may be played back for analysis. The logging 
process included in the scripting package allows panel-level playback of the voice interaction 
either in conjunction with, or independent from the ASR analyzing function of the system. 

10 A block diagram providing an additional representation of the call center actions is 

shown in Figure 5. The ASR Interface 56 is used to set the initial conditions of the ASR 
component of the script compliance module. The initial conditions of the ASR component 
include the definitions of the ASR texts 58, the definitions of the evaluation conditions 60 - i.e., 
the point in time during a voice interaction a given ASR text is expected to be read - and the 

15 action rules 62, discussed above. Any changes or modifications to the initial conditions are 
made by accessing these features via the ASR Interface 56 and making the desired changes. 

When a call is processed 64, a voice recording is made 66 and, optionally, a video 
recording 68 is made. Each of these recordings may be separately logged and stored for later 
retrieval as needed. A log record 70 is created of the voice interaction during the call and is 

20 used, along with the ASR initial conditions, to build an expected speech list 72 to which the 

voice recording will be compared. For example, as a call is processed, the agent will view, read 
from, and enter information into several panels according to the nature and flow of the call. The 
interactive logic concerning all branching of the scripts and panels provided to the agent during 
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the call is maintained on the central computer or, alternatively, in the script compliance module, 
and dictates which call scripts are presented to the agent at each step of the call. The evaluation 
conditions contain the information coordinating the voice interaction, scripts, panels, and ASR 
texts. These are used to build the expected speech list. 
5 The actual voice recording is then compared 74 to the ASR text to determine compliance. 

A score is generated 76 indicating the measured compliance, taking into account the confidence- 
level thresholds of the ASR component, and the score is evaluated 78 against pre-determined 
standards. The pre-determined standards may be static or may vary, and may be included in the 
ASR evaluation conditions. For example, an 80% accuracy score may be sufficient for one 

10 script or script portion, but a 90% accuracy score may be required for another script or portion. 
The score and evaluation may be added to a report 80 of the call for later retrieval. An action 82 
is next taken based upon the score according to the pre-determined set of action rules. Examples 
of these actions include e-mailing a report (which may optionally include a copy of the digital 
recording of the voice interaction) to a QA authority 84, providing a feedback message directly 

15 to the agent 86, or any other 88 action appropriate for the given application. 

Figure 6 is a data flow diagram of additional aspects of the invention suitable for 
identifying further forms of agent non-compliance with scripts governing interactions involving 
such agents. These other aspects of the invention provide various methods, systems, and 
computer-readable media readable by one or more processors that are capable of evaluating 

20 compliance of one or more agents with one or more script that governs, at least in part, one or 
more interactions processed by the agent or agents. One illustrative but non-limiting 
embodiment of the method comprises at least creating one or more voice records or data 602 
(which may be stored in a data store 604) of one or more interactions processed by the agent, 
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defining at least first data relating to evaluating compliance of the agent with the one script 
(stored, for example, in a data store 612), and processing a given voice record 605 against given 
data comparison 613. 

The method can also include identifying one or more instances of agent non-compliance 
5 with the script based upon the processing of the voice record 605 against comparison data 613. 
These instances of agent non-compliance with the script can then be processed further, as 
detailed below. 

The voice records data store 604 can be created to store data 602 resulting from or 
relating to one or more interactions, such as telephone calls or other types of interactions handled 

10 by an agent physically located in a call center, which can be either an in-house call center run by 
a given company or firm, or a third-party call center to which such companies or firms might 
outsource call-handling functions. Conversely, the voice records data store 602 can be created to 
store data 602 resulting from or relating to interactions handled by an agent physically located in 
a location remote from a call center. Such agents are known as remote agents, or if the agent is 

15 working from his or her residence, as a "home agent" or "home-based agent". In the context of 
home- or remotely-based agents, such agents may not be not subject to immediate, in-person 
supervision, so the need to proactively detect and investigate script compliance issues of the type 
discussed herein can be particularly acute with these agents, perhaps even more so than with call 
center-based agents who are supervised in-person. Finally, the invention as described herein can 

20 be deployed at a call center that either receives incoming or inbound calls or that generates or 
originates outgoing or outbound calls. 

Turning in more detail to the comparison data 613 from the data store 612, this first data 
612 can relate to script compliance and may be embodied in a list or other suitable data structure 
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that includes or contains one or more words 610 or other verbiage that also appear in the script 
that governs interactions handled by the agent(s). For such words, systems or processors 606 
configured according to the invention may focus on ensuring that these words 610 from the 
script appear in the voice record data 605 for a given interaction. These words 610 may appear 
5 at key portions within the script, for example in portions of the script pertaining to conversion(s), 
upsell(s), or the like. In these instances, it may be important from a contractual, quality- 
assurance, or other standpoint to ensure that the agent read these key portions of the script fully 
and accurately. By searching for certain key words 610 in the voice record 605, the invention 
can facilitate automatic processing of voice records 602 of agent interactions to ensure that the 

10 agent complied with the script, especially these key portions of the script. As discussed above, 
speech recognition engines can be deployed to process the given voice record 605 as taught 
above to facilitate analysis of the voice record 605 against comparison data 613, which in this 
aspect of the invention, takes the form of the words or verbiage 610 from the script. The output 
of this analysis is a compliance score 614, which in this aspect of the invention indicates whether 

15 the agent read at least the key portions of the script. 

In contrast to or in connection with ensuring that script text 610 appears in the voice 
record, the invention can also include ensuring that the agent did not speak or employ any 
extraneous, unexpected, or improper verbiage 608 during the interaction. To support this 
function, the invention can use data in the form of a list that contains one or more specifically 

20 enumerated words or phrases 608 that do not appear in the script, and are not expected to appear 
in the voice records 605 of given interactions. In these aspects of the invention, the comparison 
data 613 takes the form of the verbiage 608, as retrieved from the data store 612. These specific 
words or phrases 608 can include profane or abusive language, or other unexpected or improper 
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verbiage as may be recognized by those skilled in the art. The compliance store 614 would 
reflect the presence of such unexpected verbiage 608 in the voice record 605. 

In the case of profanity or abuse by the agent, the real-time aspects of the invention 
discussed above can be very useful for identifying such profanity or abuse in real-time after it 
5 occurs so follow-up action with the agent (either a call center agent or a home agent) can be 
taken quickly before other interactions are affected by the same agent. Illustrative but non- 
limiting examples of follow-up action can include disciplining the agent, disconnecting the 
agent, terminating employment of the agent, routing or directing calls away from the agent, or 
any other action suitable under such circumstances as recognized by those skilled in the call 
10 center management arts. 

In other aspects, the invention can realize a similar goal by processing or comparing the 
verbiage contained in a given voice record 605 with the verbiage 610 in the script governing the 
interaction captured in that voice record 605. The output 614 of this comparison, revealing a 
compliance score in the form of the verbal "difference" between the two, can indicate any excess 
15 verbiage spoken by the agent that was not dictated by the script, and may further identify any 
extraneous, unexpected, or improper verbiage spoken by the agent that was not or could not be 
expressly enumerated and stored in a pre-defined list such as verbiage 608. Having identified 
such surplus or excess language, the invention can identify or flag this language for further more 
detailed review by either a live supervisor or an automated process. This feature can provide a 
20 further, more flexible capability to police agent conduct while handling interactions. 

In light of this teaching, agent non-compliance can take several forms, including 
determining that one or more words 610 contained in a predefined list are not contained in a 
given voice record 605. Here, the comparison data 613 takes the form of this verbiage 610. In 
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one aspect, if the predefined list contains "target words" 610 corresponding to conversions, 
upsells, or the like, and the voice record 605 for a given interaction does not contain these "target 
words" 610, then the agent may be in non-compliance on this given interaction. In another 
aspect, if the predefined list contains "forbidden words" 608 (one non-limiting example of the 
5 verbiage 608) corresponding to abuse, profanity, or the like, and the voice record 605 for a given 
interaction does not contain these "forbidden words", then the agent may be in compliance on 
this given interaction. However, if "forbidden words" 608 are in the predefined list and are not 
in the script 610, but are in the voice record 605, then the agent may be in non-compliance for 
using such words 608. Finally, if a comparison of the script 610 governing a given interaction 

10 and the voice record 605 for the given interaction reveals surplus or excess language that is not 
on a predefined "forbidden word" list 608, this surplus or excess language can be highlighted for 
further review to determine whether the agent is in non-compliance. 

Having identified non-compliance in any of the above forms, a specific instance of agent 
non-compliance can be processed by entering data representing the instance of agent non- 

15 compliance into a data store, for example, one that supports further follow-up by a supervisor, 
that is related to a compliance incentive program, or the like. 

Those skilled in the art will further recognize that for the purposes of convenience, 
legibility, and clarity, Figure 6 illustrates the various data stores 604 and 610 separately, and they 
are discussed separately herein. However, the embodiment shown in Figure 6 is illustrative 

20 rather than limiting, and some or all of these various data stores could readily be combined, 

separated, or consolidated into one or more data stores without departing from the scope of the 
invention. 
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A suitable application program can implement software residing on the computer- 
readable medium or media readable and executable by the processor 606 and embodying the 
various aspects of the method discussed herein and shown in the drawing figures, and can be 
coded using any suitable programming or scripting language. However, it is to be understood 
5 that the invention as described herein is not dependent on any particular operating system, 
environment, or programming language. Illustrative operating systems include without 
limitation LINUX, UNIX, or any of the Windows™-family of operating systems, and illustrative 
languages include without limitation a variety of structured and object-oriented languages such 
as C, C++, Visual Basic, or the like. Likewise, the invention as described herein is not 

10 dependent on any particular make or model of processor 606. Instead, any commercially 

available microprocessor or central processing unit (CPU) may be suitable, depending on the 
circumstances of a given application of the invention. 

As those skilled in the art will understand, the program of instructions can be loaded and 
stored onto a program storage medium or device readable by the processor 606 or other machine, 

15 embodying a program of instructions executable by the processor 606 to perform the various 
aspects of the invention as discussed and claimed herein, and as illustrated in the drawing 
figures. Generally speaking, the program storage medium can be implemented using any 
technology based upon materials having specific magnetic, optical, semiconductor or other 
properties that render them suitable for storing computer-readable data, whether such technology 

20 involves either volatile or non- volatile storage media. Specific examples of such media can 

include, but are not limited to, magnetic hard or floppy disks drives, optical drives or CD-ROMs, 

and any memory technology based on semiconductors or other materials, whether implemented 

as read-only or random access memory. In short, this embodiment of the invention may reside 
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either on a medium directly addressable by the computer's processor (main memory, however 
implemented) or on a medium indirectly accessible to the processor (secondary storage media 
such as hard disk drives, tape drives, CD-ROM drives, floppy drives, or the like). Consistent 
with the above teaching, program storage device can be affixed permanently or removably to a 
5 bay, socket, connector, or other hardware provided by the cabinet, motherboard, or other 
component of a given computer system. 

Those skilled in the art will also understand that a computer programmed in accordance 
with the above teaching using known programming languages provides suitable means for 
realizing the various functions, methods, and processes as described and claimed herein and as 

10 illustrated in the drawing figure attached hereto. 

Those skilled in the art, when reading this description, will understand that unless 
expressly stated to the contrary, the use of the singular or the plural number herein is illustrative, 
rather than limiting, of the instant invention. Accordingly, where a given term is discussed in the 
singular number, it will be well understood that the invention also contemplates a plural number 

15 of the item corresponding to the given term and vice versa, unless expressly stated herein to the 
contrary. 

Various embodiments of the invention are described above to facilitate a thorough 
understanding of various aspects of the invention. However, these embodiments are to be 
understood as illustrative rather than limiting in nature, and those skilled in the art will recognize 
20 that various modifications or extensions of these embodiments will fall within the scope of the 
invention, which is defined by the appended claims. 

The foregoing cited references, patents and publications are hereby incorporated herein 
by reference, as if fully set forth herein. Although the foregoing invention has been described in 
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some detail by way of illustration and example for purposes of clarity and understanding, it may 
be readily apparent to those of ordinary skill in the art in light of the teachings of this invention 
that certain changes and modifications may be made thereto without departing from the spirit or 
scope of the appended claims. 
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